Sequence Listing was accepted. 

If you need help call the Patent Electronic Business Center at (866) 
217-9197 (toll free) . 
Reviewer : markspencer 

Timestamp: [year=2009; month=9; day=17; hr=14; min=2 6 ; sec=31; ms = 7 37 ; ] 



Validated By CRFValidator v 1.0.3 
Application No: 10572638 Version No: 



2.0 



Input Set: 



Output Set: 



Started: 2009-08-14 12:10:31.800 

Finished: 2009-08-14 12:10:45.660 

Elapsed: 0 hr(s) 0 min(s) 13 sec(s) 860 ms 

Total Warnings : 5 

Total Errors: 0 

No. of SeqIDs Defined: 321 

Actual SeqID Count: 321 

Error code Error Description 

W 251 Found intentionally skipped sequence in SEQID (27 ) 

W 251 Found intentionally skipped sequence in SEQID (149 ) 

W 213 Artificial or Unknown found in <213> in SEQ ID (305) 

W 213 Artificial or Unknown found in <213> in SEQ ID (306) 

W 213 Artificial or Unknown found in <213> in SEQ ID (307) 



SEQUENCE LISTING 

<110> HAYNES , Barton F. 
GAO, Feng 
KORBER, Bette T. 
HAHN, Beatrice H. 
SHAW, George M . 
KOTHE, DENISE 
LI, Ying Ying 
DECKER, Julie 
LIAO, Hua-Xin 

<12 0> CONSENSUS /ANCESTRAL IMMUNOGENS 

<130> 01579-1093 

<140> 10572638 
<141> 2006-12-22 



<150> PCT/US2004/03097 
<151> 2004-09-17 

<150> 60/604, 722 
<151> 2004-08-27 

<150> 60/503, 460 
<151> 2003-09-17 

<160> 321 

<170> Patentln Ver. 3.3 

<210> 1 
<211> 866 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 1 

Met Arg Val Met Gly lie Gin Arg Asn Cys Gin His Leu Trp Arg Trp 
15 10 15 

Gly Thr Met lie Leu Gly Met Leu Met lie Cys Ser Ala Ala Glu Asn 
20 25 30 

Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Asn 
35 40 45 

Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val 
50 55 60 

His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro 
65 70 75 80 



Gin Glu lie Val Leu Glu Asn Val Thr Glu Asn Phe Asn Met Trp Lys 
85 90 95 



Asn Asn Met Val Glu Gin Met His Glu Asp lie lie Ser Leu Trp Asp 
100 105 110 



Gin Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 
115 120 125 

Asn Cys Thr Asn Val Arg Asn Val Ser Ser Asn Gly Thr Glu Thr Asp 
130 135 140 

Asn Glu Glu lie Lys Asn Cys Ser Phe Asn lie Thr Thr Glu Leu Arg 
145 150 155 160 

Asp Lys Lys Gin Lys Val Tyr Ala Leu Phe Tyr Arg Leu Asp Val Val 
165 170 175 

Pro lie Asp Asp Lys Asn Ser Ser Glu lie Ser Gly Lys Asn Ser Ser 
180 185 190 

Glu Tyr Tyr Arg Leu lie Asn Cys Asn Thr Ser Ala lie Thr Gin Ala 
195 200 205 

Cys Pro Lys Val Ser Phe Glu Pro lie Pro lie His Tyr Cys Ala Pro 
210 215 220 

Ala Gly Phe Ala lie Leu Lys Cys Asn Asp Lys Lys Phe Asn Gly Thr 
225 230 235 240 

Gly Pro Cys Lys Asn Val Ser Thr Val Gin Cys Thr His Gly lie Lys 
245 250 255 

Pro Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu 

260 265 270 

Glu lie lie lie Arg Ser Glu Asn lie Thr Asn Asn Ala Lys Thr lie 

275 280 285 

lie Val Gin Leu Asn Glu Ser Val Glu lie Asn Cys Thr Arg Pro Asn 
290 295 300 

Asn Asn Thr Arg Lys Ser lie His lie Gly Pro Gly Gin Ala Phe Tyr 
305 310 315 320 

Ala Thr Gly Glu lie lie Gly Asp lie Arg Gin Ala His Cys Asn lie 

325 330 335 

Ser Arg Thr Lys Trp Asn Lys Thr Leu Gin Gin Val Ala Lys Lys Leu 
340 345 350 

Arg Glu His Phe Asn Asn Lys Thr lie lie Phe Lys Pro Ser Ser Gly 
355 360 365 

Gly Asp Leu Glu lie Thr Thr His Ser Phe Asn Cys Gly Gly Glu Phe 
370 375 380 



Phe Tyr Cys Asn Thr Ser Gly Leu Phe Asn Ser Thr Trp Met Phe Asn 
385 390 395 400 



Gly Thr Tyr Met Phe Asn Gly Thr Lys Asp Asn Ser Glu Thr lie Thr 
405 410 415 

Leu Pro Cys Arg lie Lys Gin lie lie Asn Met Trp Gin Gly Val Gly 
420 425 430 

Gin Ala Met Tyr Ala Pro Pro lie Glu Gly Lys lie Thr Cys Lys Ser 
435 440 445 

Asn lie Thr Gly Leu Leu Leu lie Thr Arg Asp Gly Gly Asn Asn Ser 
450 455 460 

Asn Lys Asn Lys Thr Glu Thr Phe Arg Pro Gly Gly Gly Asp Met Arg 
465 470 475 480 

Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys lie Glu 
485 490 495 

Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Glu Arg 
500 505 510 

Glu Lys Arg Ala Val Gly lie Gly Ala Val Phe Leu Gly Phe Leu Gly 
515 520 525 

Ala Ala Gly Ser Thr Met Gly Ala Ala Ser lie Thr Leu Thr Val Gin 
530 535 540 

Ala Arg Gin Leu Leu Ser Gly lie Val Gin Gin Gin Ser Asn Leu Leu 
545 550 555 560 

Arg Ala lie Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly 

565 570 575 

lie Lys Gin Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Lys 
580 585 590 

Asp Gin Gin Leu Leu Gly lie Trp Gly Cys Ser Gly Lys Leu lie Cys 
595 600 605 

Thr Thr Asn Val Pro Trp Asn Ser Ser Trp Ser Asn Lys Ser Gin Asp 
610 615 620 

Glu lie Trp Asp Asn Met Thr Trp Met Glu Trp Glu Arg Glu lie Ser 

625 630 635 640 

Asn Tyr Thr Asp lie lie Tyr Arg Leu lie Glu Glu Ser Gin Asn Gin 
645 650 655 

Gin Glu Lys Asn Glu Gin Glu Leu Leu Ala Leu Asp Lys Ala Ser Leu 
660 665 670 

Trp Asn Trp Phe Asp lie Thr Asn Trp Leu Trp Tyr lie Lys lie Phe 
675 680 685 

lie Met lie Val Gly Gly Leu lie Gly Leu Arg lie Val Phe Ala Val 
690 695 700 



Leu Ser lie Val 
705 

Gin Thr Leu lie 



Glu Glu Glu Gly 
740 

Asn Gly Phe Leu 
755 

Phe Ser Tyr His 
770 

Val Glu Leu Leu 
785 

Glu Ala Leu Lys 



Leu Lys Asn Ser 
820 

Ala Glu Gly Thr 

835 

Ala lie Leu Asn 
850 



Asn Arg Val Arg 
710 

Pro Asn Pro Arg 
725 

Gly Glu Gin Gly 



Ala Leu Ala Trp 
760 

Arg Leu Arg Asp 
775 

Gly Arg Arg Ser 
790 

Tyr Leu Gly Asn 
805 

Ala lie Ser Leu 



Asp Arg Val lie 
840 

lie Pro Arg Arg 
855 



Gin Gly Tyr Ser 
715 

Gly Pro Asp Arg 
730 

Arg Asp Arg Ser 
745 

Asp Asp Leu Arg 



Phe lie Leu lie 
780 

Leu Arg Gly Leu 
795 

Leu Leu Gin Tyr 
810 

Leu Asp Thr Thr 

825 

Glu lie Val Gin 



lie Arg Gin Gly 
860 



Pro Leu Ser Phe 
720 

Pro Glu Gly lie 

735 

lie Arg Leu Val 
750 

Ser Leu Cys Leu 
765 

Ala Ala Arg Thr 



Gin Lys Gly Trp 
800 

Trp Gly Gin Glu 
815 

Ala lie Ala Val 
830 

Arg Ala Cys Arg 
845 

Leu Glu Arg Ala 



Leu Leu 
865 



<210> 2 

<211> 2607 

<212> DNA 

<213> Human immunodeficiency virus 



<400> 2 

gccaccatgc gcgtgatggg catccagcgc aactgccagc acctgtggcg ctggggcacc 60 

atgatcctgg gcatgctgat gatctgctcc gccgccgaga acctgtgggt gaccgtgtac 120 

tacggcgtgc ccgtgtggaa ggaggccaac accaccctgt tctgcgcctc cgacgccaag 180 

gcctacgaca ccgaggtgca caacgtgtgg gccacccacg cctgcgtgcc caccgacccc 240 

aacccccagg agatcgtgct ggagaacgtg accgagaact tcaacatgtg gaagaacaac 300 

atggtggagc agatgcacga ggacatcatc tccctgtggg accagtccct gaagccctgc 360 

gtgaagctga cccccctgtg cgtgaccctg aactgcacca acgtgcgcaa cgtgtcctcc 420 

aacggcaccg agaccgacaa cgaggagatc aagaactgct ccttcaacat caccaccgag 480 

ctgcgcgaca agaagcagaa ggtgtacgcc ctgttctacc gcctggacgt ggtgcccatc 540 

gacgacaaga actcctccga gatctccggc aagaactcct ccgagtacta ccgcctgatc 600 

aactgcaaca cctccgccat cacccaggcc tgccccaagg tgtccttcga gcccatcccc 660 

atccactact gcgcccccgc cggcttcgcc atcctgaagt gcaacgacaa gaagttcaac 720 

ggcaccggcc cctgcaagaa cgtgtccacc gtgcagtgca cccacggcat caagcccgtg 780 

gtgtccaccc agctgctgct gaacggctcc ctggccgagg aggagatcat catccgctcc 840 

gagaacatca ccaacaacgc caagaccatc atcgtgcagc tgaacgagtc cgtggagatc 900 

aactgcaccc gccccaacaa caacacccgc aagtccatcc acatcggccc cggccaggcc 960 

ttctacgcca ccggcgagat catcggcgac atccgccagg cccactgcaa catctcccgc 1020 



accaagtgga acaagaccct gcagcaggtg gccaagaagc tgcgcgagca cttcaacaac 1080 

aagaccatca tcttcaagcc ctcctccggc ggcgacctgg agatcaccac ccactccttc 1140 

aactgcggcg gcgagttctt ctactgcaac acctccggcc tgttcaactc cacctggatg 1200 

ttcaacggca cctacatgtt caacggcacc aaggacaact ccgagaccat caccctgccc 1260 

tgccgcatca agcagatcat caacatgtgg cagggcgtgg gccaggccat gtacgccccc 1320 

cccatcgagg gcaagatcac ctgcaagtcc aacatcaccg gcctgctgct gacccgcgac 1380 

ggcggcaaca actccaacaa gaacaagacc gagaccttcc gccccggcgg cggcgacatg 1440 

cgcgacaact ggcgctccga gctgtacaag tacaaggtgg tgaagatcga gcccctgggc 1500 

gtggccccca ccaaggccaa gcgccgcgtg gtggagcgcg agaagcgcgc cgtgggcatc 1560 

ggcgccgtgt tcctgggctt cctgggcgcc gccggctcca ccatgggcgc cgcctccatc 1620 

accctgaccg tgcaggcccg ccagctgctg tccggcatcg tgcagcagca gtccaacctg 1680 

ctgcgcgcca tcgaggccca gcagcacctg ctgcagctga ccgtgtgggg catcaagcag 1740 

ctgcaggccc gcgtgctggc cgtggagcgc tacctgaagg accagcagct gctgggcatc 1800 

tggggctgct ccggcaagct gatctgcacc accaacgtgc cctggaactc ctcctggtcc 1860 

aacaagtccc aggacgagat ctgggacaac atgacctgga tggagtggga gcgcgagatc 192 0 

tccaactaca ccgacatcat ctaccgcctg atcgaggagt cccagaacca gcaggagaag 1980 

aacgagcagg agctgctggc cctggacaag tgggcctccc tgtggaactg gttcgacatc 2040 

accaactggc tgtggtacat caagatcttc atcatgatcg tgggcggcct gatcggcctg 2100 

cgcatcgtgt tcgccgtgct gtccatcgtg aaccgcgtgc gccagggcta ctcccccctg 2160 

tccttccaga ccctgatccc caacccccgc ggccccgacc gccccgaggg catcgaggag 2220 

gagggcggcg agcagggccg cgaccgctcc atccgcctgg tgaacggctt cctggccctg 2280 

gcctgggacg acctgcgctc cctgtgcctg ttctcctacc accgcctgcg cgacttcatc 2340 

ctgatcgccg cccgcaccgt ggagctgctg ggccgccgct ccctgcgcgg cctgcagaag 2400 

ggctgggagg ccctgaagta cctgggcaac ctgctgcagt actggggcca ggagctgaag 2460 

aactccgcca tctccctgct ggacaccacc gccatcgccg tggccgaggg caccgaccgc 2520 

gtgatcgaga tcgtgcagcg cgcctgccgc gccatcctga acatcccccg ccgcatccgc 2580 

cagggcctgg agcgcgccct gctgtaa 2 607 



<210> 3 
<211> 2535 
<212> DNA 

<213> Human immunodeficiency virus 
<400> 3 

gccgccatgc gcgtgatggg catcctgcgc aactgccagc agtggtggat ctggggcatc 60 

ctgggcttct ggatgctgat gatctgctcc gtggtgggca acctgtgggt gaccgtgtac 120 

tacggcgtgc ccgtgtggaa ggaggccaag accaccctgt tctgcgcctc cgacgccaag 180 

gcctacgagc gcgaggtgca caacgtgtgg gccacccacg cctgcgtgcc caccgacccc 240 

aacccccagg agatggtgct ggagaacgtg accgagaact tcaacatgtg gaagaacgac 300 

atggtggacc agatgcacga ggacatcatc tccctgtggg accagtccct gaagccctgc 360 

gtgaagctga cccccctgtg cgtgaccctg aactgcacca acgtgaccaa cgccaccaac 420 

aacacctaca acggcgagat gaagaactgc tccttcaaca tcaccaccga gctgcgcgac 480 

aagaagaaga aggagtacgc cctgttctac cgcctggaca tcgtgcccct gaacgagaac 540 

tcctccgagt accgcctgat caactgcaac acctccgcca tcacccaggc ctgccccaag 600 

gtgtccttcg accccatccc catccactac tgcgcccccg ccggctacgc catcctgaag 660 

tgcaacaaca agaccttcaa cggcaccggc ccctgcaaca acgtgtccac cgtgcagtgc 720 

acccacggca tcaagcccgt ggtgtccacc cagctgctgc tgaacggctc cctggccgag 780 

gaggagatca tcatccgctc cgagaacctg accgacaacg ccaagaccat catcgtgcag 840 

ctgaacgagt ccgtggagat cgtgtgcacc cgccccaaca acaacacccg caagtccatg 900 

cgcatcggcc ccggccagac cttctacgcc accggcgaca tcatcggcga catccgccag 960 

gcccactgca acatctccga ggacaagtgg aacaagaccc tgcagcaggt ggccgagaag 1020 

ctgggcaagc acttccccaa caagaccatc accttcgagc cctcctccgg cggcgacctg 1080 

gagatcacca cccactcctt caactgccgc ggcgagttct tctactgcaa cacctccaag 1140 

ctgttcaact ccacctacaa caacaacacc aactccaact ccaccatcac cctgccctgc 1200 

cgcatcaagc agatcatcaa catgtggcag ggcgtgggcc aggccatgta cgcccccccc 1260 

atcgccggca acatcacctg caagtccaac atcaccggcc tgctgctgac ccgcgacggc 1320 



ggcaaggaga acaccaccga gaccttccgc cccggcggcg gcgacatgcg cgacaactgg 1380 

cgctccgagc tgtacaagta caaggtggtg gagatcaagc ccctgggcgt ggcccccacc 1440 

gaggccaagc gccgcgtggt ggagcgcgag aagcgcgccg tgggcctggg cgccgtgttc 1500 

ctgggcttcc tgggcgccgc cggctccacc atgggcgccg cctccatcac cctgaccgtg 1560 

caggcccgcc agctgctgtc cggcatcgtg cagcagcagt ccaacctgct gcgcgccatc 1620 

gaggcccagc agcacatgct gcagctgacc gtgtggggca tcaagcagct gcaggcccgc 1680 

gtgctggcca tggagcgcta cctgaaggac cagcagctgc tgggcatctg gggctgctcc 1740 

ggcaagctga tctgcaccac cgccgtgccc tggaactcct cctggtccaa caagtccctg 1800 

gacgacatct gggacaacat gacctggatg gagtgggacc gcgagatctc caactacacc 1860 

gacaccatct accgcctgct ggaggagtcc cagaaccagc aggagaagaa cgagcaggac 192 0 

ctgctggccc tggactcctg ggagaacctg tggaactggt tcgacatcac caactggctg 1980 

tggtacatca agatcttcat catgatcgtg ggcggcctga tcggcctgcg catcatcttc 2040 

gccgtgctgt ccatcgtgaa ccgcgtgcgc cagggctact cccccctgtc cttccagacc 2100 

ctgaccccca acccccgcgg ccccgaccgc ctggagcgca tcgaggagga gggcggcgag 2160 

caggaccgcg accgctccat ccgcctggtg tccggcttcc tggccctggc ctgggacgac 2220 

ctgcgctccc tgtgcctgtt ctcctaccac cgcctgcgcg acttcatcct gatcgccgcc 2280 

cgcaccgtgg agctgctggg ccgctcctcc ctgcgcggcc tgcagcgcgg ctgggaggcc 2340 

ctgaagtacc tgggctccct ggtgcagtac tggggccagg agctgaagaa gtccgccatc 2400 

tccctgctgg acaccatcgc catcgccgtg gccgagggca ccgaccgcat catcgaggtg 2460 

gtgcagcgcg cctgccgcgc catcctgaac atcccccgcc gcatccgcca gggcttcgag 2520 

gccgccctgc tgtaa 2535 



<210> 4 
<211> 2535 
<212> DNA 

<213> Human immunodeficiency virus 
<400> 4 

gccgccatgc gcgtgatggg catcctgcgc aactgccagc agtggtggat ctggggcatc 60 

ctgggcttct ggatgctgat gatctgcaac gtggtgggca acctgtgggt gaccgtgtac 120 

tacggcgtgc ccgtgtggaa ggaggccaag accaccctgt tctgcgcctc cgacgccaag 180 

gcctacgaga aggaggtgca caacgtgtgg gccacccacg cctgcgtgcc caccgacccc 240 

aacccccagg agatggtgct ggagaacgtg accgagaact tcaacatgtg gaagaacgac 300 

atggtggacc agatgcacga ggacatcatc tccctgtggg accagtccct gaagccctgc 360 

gtgaagctga cccccctgtg cgtgaccctg aactgccgca acgtgaccaa cgccaccaac 420 

aacacctaca acgaggagat caagaactgc tccttcaaca tcaccaccga gctgcgcgac 480 

aagaagaaga aggtgtacgc cctgttctac cgcctggaca tcgtgcccct gaacgagaac 540 

tcctccgagt accgcctgat caactgcaac acctccgcca tcacccaggc ctgccccaag 600 

gtgtccttcg accccatccc catccactac tgcgcccccg ccggctacgc catcctgaag 660 

tgcaacaaca agaccttcaa cggcaccggc ccctgcaaca acgtgtccac cgtgcagtgc 720 

acccacggca tcaagcccgt ggtgtccacc cagctgctgc tgaacggctc cctggccgag 780 

gaggagatca tcatccgctc cgagaacctg accaacaacg ccaagaccat catcgtgcac 840 

ctgaacgagt ccgtggagat cgtgtgcacc cgccccaaca acaacacccg caagtccatc 900 

cgcatcggcc ccggccagac cttctacgcc accggcgaca tcatcggcga catccgccag 960 

gcccactgca acatctccga ggacaagtgg aacaagaccc tgcagcgcgt gtccaagaag 1020 

ctgaaggagc acttccccaa caagaccatc aagttcgagc cctcctccgg cggcgacctg 1080 

gagatcacca cccactcctt caactgccgc ggcgagttct tctactgcaa cacctccaag 1140 

ctgttcaact ccacctacaa caacaacacc aactccaact ccaccatcac cctgccctgc 1200 

cgcatcaagc agatcatcaa catgtggcag gaggtgggcc gcgccatgta cgcccccccc 1260 

atcgccggca acatcacctg caagtccaac atcaccggcc tgctgctgac ccgcgacggc 1320 

ggcaagaaga acaccaccga gatcttccgc cccggcggcg gcgacatgcg cgacaactgg 1380 

cgctccgagc tgtacaagta caaggtggtg gagatcaagc ccctgggcgt ggcccccacc 1440 

aaggccaagc gccgcgtggt ggagcgcgag aagcgcgccg tgggcatcgg cgccgtgttc 1500 

ctgggcttcc tgggcgccgc cggctccacc atgggcgccg cctccatcac cctgaccgtg 1560 

caggcccgcc agctgctgtc cggcatcgtg cagcagcagt ccaacctgct gcgcgccatc 1620 

gaggcccagc agcacatgct gcagctgacc gtgtggggca tcaagcagct gcagacccgc 1680 



gtgctggcca tcgagcgcta cctgaaggac cagcagctgc tgggcatctg gggctgctcc 1740 

ggcaagctga tctgcaccac cgccgtgccc tggaactcct cctggtccaa caagtcccag 1800 

gaggacatct gggacaacat gacctggatg cagtgggacc gcgagatctc caactacacc 1860 

gacaccatct accgcctgct ggaggactcc cagaaccagc aggagaagaa cgagaaggac 192 0 

ctgctggccc tggactcctg gaagaacctg tggaactggt tcgacatcac caactggctg 1980 

tggtacatca agatcttcat catgatcgtg ggcggcctga tcggcctgcg catcatcttc 2040 

gccgtgctgt ccatcgtgaa ccgcgtgcgc cagggctact cccccctgtc cttccagacc 2100 

ctgaccccca acccccgcgg ccccgaccgc ctgggccgca tcgaggagga gggcggcgag 2160 

caggaccgcg accgctccat ccgcctggtg tccggcttcc tggccctggc ctgggacgac 2220 

ctgcgctccc tgtgcctgtt ctcctaccac cgcctgcgcg acttcatcct ggtggccgcc 2280 

cgcgccgtgg agctgctggg ccgctcctcc ctgcgcggcc tgcagcgcgg ctgggaggcc 2340 

ctgaagtacc tgggctccct ggtgcagtac tggggcctgg agctgaagaa gtccgccatc 2400 

tccctgctgg acaccatcgc catcgccgtg gccgagggca ccgaccgcat catcgagctg 2460 

atccagcgca tctgccgcgc catccgcaac atcccccgcc gcatccgcca gggcttcgag 2520 

gccgccctgc agtaa 2535 



<210> 5 
<211> 842 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 5 

Met Arg Val Met Gly lie Leu Arg Asn Cys Gin Gin Trp Trp lie Trp 



